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Abstract — A bound on the maximum information transmis- 
sion rate through a cascade of Gaussian links is presented. 
The network model consists of a source node attempting 
to send a message drawn from a finite alphabet to a 
sink, through a cascade of Additive White Gaussian Noise 
links each having an input power constraint. Intermediate 
nodes are allowed to perform arbitrary encoding/decoding 
operations, but the block length and the encoding rate are 
fixed. The bound presented in this paper is fundamental and 
depends only on the design parameters namely, the network 
size, block length, transmission rate, and signal-to-noise ratio. 



I. Introduction 

Transmission of messages through a series of links 
corrupted by noise is a situation that occurs frequently 
in communication networks. When the transmission block 
length is allowed to be arbitrarily large, it is quite simple 
to show (using the data-processing inequality) that the 
maximum information transfer rate is equal to the capacity 
of the weakest link. The possibilities in the finite block 
length regime are far less clear. Past work by Niesen et 
al. in 0] and by us in (2) have addressed this question for 
the Discrete Memoryless Channel (DMC) case and the Ad- 
ditive White Gaussian Noise (AWGN) case, respectively. 
These results are asymptotic and provide scaling laws for 
the block length in terms of the number of nodes. 

In this paper, we provide a universal non-asymptotic 
bound on the maximum rate of information transfer for a 
line network consisting of a cascade of AWGN links. This 
complements and improves the asymptotic scaling results 
derived in |j2]- The bound derived here is universal in the 
following sense: 

1) While we assume that the block length and encoding 
rates are constant for all the nodes, we do not assume 
any particular structure for the channel codes and 
decision rules employed at any of the nodes. 

2) In addition, no assumption is made on the abso- 
lute/relative magnitudes of the network size and the 
block length. 

It is to be noted that the analysis in Q was found to 
be unsuitable to our requirement that the bound be non- 
asymptotic, and hence we take a totally new approach here. 

The rest of the paper is organized as follows. In Sec- 
tion |nj we introduce the notations and definitions used 
in the rest of the paper. In Section [HI] we introduce 
the network and the signal nansmission models. We then 
provide our main result followed by its derivation in 
Section IIV1 followed by a short discussion in Section [V] 



that includes a comparison of our current results in relation 
to our previous results in |Z|- 

II. Notations and Definitions 

Let 1R be the set of all real numbers and IN be the set of 
all natural numbers. Natural logarithms are assumed unless 
the base is specified. The notation || • j represents C 2 norm 
throughout. JT'mxm denotes the set of all M x M row- 
stochastic matrices, and ^mxM denotes the set of all M x 
M row-stochastic matrices whose rows are identical. 

Let N G IN denote the code length or block length of the 
transmission scheme. A code rate R > is a real number 
such that 2 NR is an integer. Let J{ = {1, 2, 3, ... , 2^} 
be the message alphabet. 

Definition 1: For a certain Po > 0, a rate R length N 
code & with power constraint Pq is an ordering of M = 
2 NR elements from Mr, called codewords, such that the 
power of any codeword is lower than P$: 

c g = (ci,c 2 ,c 3 ,...,c M ) s.t. Vw G Jt, -^||c„,|| 2 < P Q . 

Definition 2: A rate R length N decision rule Si = 
(Hi,lZ2, ■ ■ ■ ,TZm) is an ordered partition of R N of size 
M = 2 NR . 

Definition 3: The encoding function ENCV : jtfi — ► 
for a code is defined by ENGg^tu) = c w , where 
c w is the w th codeword in ^ '. 

Definition 4: The decoding function DEC^ : R — > 
for a decision rule 0£ is defined by: 

DEC.^(y)= W iffyG^ t „, 

where 1Z W is the w* 1 partition in 



Let Hq 



r « 



-, the solid angle of a iV-sphere. Here, 



r(-) is the standard gamma function given by T(z) = 
J Q t z ~ 1 e~ t dt. We also define the following functions: 

Definition 5: Let Z\, . . . , Zm be i.i.d. zero-mean unit- 
variance Gaussian random variables. Let for any 7 > 0, 
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0, otherwise. 



Also, let \/v G [0, 7r 



A (iV-l)TT — 
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We then define the following function for x G [0, Qo]: 

Q{x,N,i)±?T\g{%)>x\. (1) 



The above function is the same as Q* (•) defined and used 
by Shannon in [3|. In other words, computing Q (x, N, 7) 
gives the probability that a signal point on the power- 
constraint sphere ||x|| 2 = NPo is displaced by a noise 
vector consisting of i.i.d. zero-mean unit-variance Gaussian 
random variables in each dimension outside an infinite 
right-circular cone of solid angle x whose apex is at the 
origin and axis runs through the original signal point. Note 
that $ 7 is a random variable whose probability distribution 
function has N and 7 as parameters. The inverse cotangent 
function is assumed to have [0, n] for its range so that it is 
continuous. Noting that cot" 1 x will then be a decreasing 
function of x, we have the following remark about the 
monotonicity of the Q-function w.r.t. 7: 

Remark 1: For any 71,72 > s.t. 71 > 72 and x G 

Mo], 

Q{x,N, l2 ) > Q{x,N, ll ). 

The term g (<fr 7 ) is equal to the solid angle of the cone 
formed by rotating the line joining the origin and the 
displaced signal point about the line joining the origin and 
the original signal point as the axis. 



III. Network Model 

The line network model to be considered is given in 
Fig. Q] There are n + 1 nodes in the network identified by 
the indices {0, 1, 2, . . . , n}. The n hops in the network 
are each associated with noise variances a 2 > a 2 > 
0, 1 < i < n. In other words, the noise variances can 
be different for each link, but they are equal to or greater 
than a certain minimum Oq that is strictly positive. Nodes 
0, 1, . . . , n— 1 choose codes ^ii ■ ■ ■ , ^n-i respectively 
to transmit, and Nodes 1,2, ...,n choose decision rules 
£%\ , 2%2 , • ■ • , 8% n for reception. From here on, for the sake 
of simplicity, we let ENQ and DEC,; to denote ENC^ 
and DEC^, respectively. All the codes and decision 
rules have the same rate R and block length N. Node 
generates a random message W € with probability 
distribution pw (w) and intends to convey the same to Node 
n through the noisy multihop path in the network. Each 
node estimates the message sent by the node in the previous 
hop from its noisy observation, encodes the message as 
a codeword, and transmits the resulting codeword to the 
next hop. The codeword transmitted by Node i, for any 
< i < n - 1 is given by X, = ENC 4 (Wi), where W l is 
the estimate of the message at Node i after decoding (Note 
that Wo = W in this notation). The observation received 
by Node i, for any 1 < i < n is given by Yj, which 
follows a conditional density function that depends on the 
codeword Xj_i sent by the previous node: 



PYiix^i (y|x) = 



1 



(2ira l 2 )^ 

The above density function follows from the assumptions 
of AWGN noise and memorylessness of the channel. The 
message Wi decoded by Node i is given by Wi = 
DECj(Yj). Note that the random variable W n represents 
the message decoded by the final sink. 



IV. The main result and analysis 



The following theorem summarizes our main result. 

Theorem 1: In a line network employing any choice 
of rate R length N codes ^lj • • • > "^n-i an d rate R 
dimension N decision rules £&\,£%i, . . . ,M n , 



I ( W; W n I < NR 



1-MQ 



M 



M 



We now delve into the proof of Theorem [TJ Let 
Pw \w 1 I 3) >yj> ^ ^ denote the conditional proba- 
bilities induced by channel encoding, noisy reception, and 
decoding at the i th hop. For each hop i, let P, be the 
M x M row-stochastic matrix whose entry in row j and 
column k is ^ (k \ j). Note that the j th row in 

Pi gives the conditional probability mass function on the 
estimate Wi of the original message W at hop i, given that 
the message sent by Node i — 1 is j. Let 

n 

p^n p < 

i=l 

Then, P clearly represents the row-stochastic probability 
transition matrix between the original message W and the 
message decoded at the sink W n . The transition matrix 
P along with pw (the probability mass function of the 
original message W) together induce a joint distribution 
between W and W n . Our goal is to find an upper bound 
on I (W; Wj^j, with the constraints given in Section iLUl 
For any M x M row-stochastic matrix Q, define 
i\) (Q I pw) — T- (W; W^j, where W is a random variable 
conditionally dependent on W according to the probability 
transition matrix Q and W is drawn according to the 
distribution pw (which is the distribution of the message 
at Node 0). For simplicity, we just write ip (Q) instead of 
■ip (Q | pw) for the rest of the paper, assuming throughout 
that the specific distribution pw is used. Ultimately, our 
final bound is independent of pw- We now have 



X ( W; W„ 



(3) 



we intro- 



Before proceeding further to bound I (W; W n ^j , 
duce the following useful lemma: 

Lemma 1: For any Qi G ^MxM an ^ anv Q2 G 
^ifxJtf,^(QiQa) = 0. 

Proof: The result follows from noting that for any 
Q G ^mxm, V(Q) = and QxQ 2 € S% xM for Q 1 G 



^1 



MxM 



and Q 2 G ^ M x 



AI- 



Now for each i, consider /?,; £ [0, 1] such that 

P i = APA+/9iP/j 1 , (4) 



( 2 ) where ft = 1 - ft, V Pi £ ^ M xM and P.- G ^ 



MxM- 



In other words for each i, P 2 ; be expressed as a convex 
combination of two row-stochastic matrices, one of them 
being a steady-state matrix. From ©, 
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Fig. 1: Line Network Model. 
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where (a) follows from the convexity property of mutual 
information w.r.t. the probability transition function, and 
(b) follows from applying the data processing inequality 
to the first term and Lemma \T\ to the second term. By 
induction, we have: 

(n \ n 

l[fa U{I M ) = NRj\p i , (5) 
i=X / i=l 

where Im is the M x M identity matrix. The above 
procedure is based on a key idea developed in the proof of 
Theorem V. 1 in 0] in a different context. We have applied 
the same to facilitate a useful intermediate result given by 
dU. The remaining portion of the analysis that enables us 
to obtain the final bound involves novel steps. 

We now need to determine how each P; is to be split 
in the form given by (|4]i in an optimal manner, to obtain 
the best possible bound using this approach. Specifically, 
we need fa to be as small as possible for each i. Consider 
the following choice: 



M 



A = i-E 



k=l 



m / n ^|vK-i I j) 



1 - 



Pi 3 " 



(6) 



where Pg. .j k denotes the element on j th row and fc* 
column of the matrix Pa- . Note that this matrix consists 
of identical rows, where each entry in any row is equal 
to the smallest element in the corresponding column of 
Pi scaled by a normalizing factor. The other matrix P ( g i 
is determined by substituting these expressions for fa 
and Ps into (0). Note that the two matrices Pp i and 
Ps determined thus will be stochastic for any i, and 
that fa £ [0, 1]. Hence, we can obtain P.; as a convex 
combination of two stochastic matrices in this manner for 
any i. The following lemma shows that the value of fa 
provided in © is the best possible value for the purpose 
of the bound in ([5]). 

Lemma 2: Let Q = [Qjk] € ^MxM, Qi € 
^a/xm,Q2 G ^mxM' an d l et P £ [0j 1] be chosen such 
that Q = /3Qi + (l-/3)Q 2 . Then, /3 > min, Q jk . 

Proof: Since Q = /3Qi + (1 — /?)Q2, every element 
of the matrix (1 — /3)Q2 must be smaller than the corre- 
sponding element in Q. Consider any column k of Q 2 . 



All the elements in that column are equal to, say, q k . It 
then follows that (1 — fa)qk < Qjk for every j, and hence 
(1 — 0)qk < mirij Qjk- Summing over all k and noting 
that Ylk=i 1 k = we °b tam the desired result. ■ 
Let be the code used by Node i — 1 and let Mi 
be the decision rule used by Node i. As per the argument 
above, the optimal choice of fa for this link will be: 



M 



I- fa 



E^^WilWi-i (k | j) 



fe=i 

E 



D -||y-c|| 2 /2<r 2 



■In (2naf) N/2 



In other words, we can write 



fa 



1 - /V, {%- 



i), 



-dy. (7) 



(8) 



where for any a > 0, rate R length N code ^ ', and rate 
R dimension N decision rule ffl, 



We would like to find a lower bound on [i ai that 
depends only on the parameters N,R,Po and ctq. To do so, 
we need the following three lemmas. Lemma [3] removes 
the dependency of the bound on the choice of the decision 
rule. Lemma [4] shows that we can restrict our choice of 
codes to those having all codewords that satisfy the power 
constraint with equality. For this class of codes, Lemma [5] 
gives a bound in the desired form, depending solely on 
N, R, Pq and Co- From now on, we denote (27tctq)^~ by rj 
for brevity. 

Lemma 3: Let = (ci, Ca, . . . , Cm) be a given rate R 
length N code. Further, let M* (ff) be the decision rule 
given by {n\, TZ* 2l . . . , TZ* M ) where for 1 < i < M, 

K* = jy G ~R N | i = argmax ||y - a> \ 

Then, for any rate R dimension N decision rule ffl, 

Proof: For any code and decision rule 3£, we have: 

. „ f lly-^-ll 2 

r\li a ( c €,£%) = min / e ^ dy 



3 Jn 



"dy 



mm e 2,,^ 



"dy 



M 



(a) 



fe=l 
M 



dy 



(6) 



E 

fc=i 

M 

E 



ft, 1 . 



mm / e 



II II 2 

2^2 dy 



fc=i 



(10) 



Here, (a) and (b) follow from the definition of ^* ( c ^'): for 
lly-^ll 2 

any y G 7J^,argmin - e ^ = fc. ■ 

It is useful to note that the decision rule (&) given by 
the above lemma is the same as the (M— l) th -order Voronoi 
partitioning (called "farthest-point Voronoi partitioning", 
see Section 3.3 in 0) of R N w.r.t. "jf . 

Lemma 4: Let ? be a code satisfying j|c|| 2 < 
NPq,Vc G <r #'. Then, there exists a code such 
that Vc' G ^',||c'|| 2 = NP and /x CT fjf)) > 



Proo/:- Let = (ci,...,c M ) and let ^* ("jf) = 
, . . . , 7Z* M ) . Consider a codeword that lies strictly in- 
side the ball j|xj| 2 < NPq. If no such codeword exists, the 
statement of the lemma is trivially true with = c if. For 
the non-trivial case, we can assume that such a codeword 
exists. Let Cfc be that codeword. Consider the decision 
region TZ* ka = {y £ R N \ k = argmax,, ||y - c^H} G 
8%* (^). The region 1Z* kf) (if non-empty) is convex since 
M* ( c to) is a Voronoi tesselation of order M — 1 and since 
Voronoi cells of any order are convex regions (see Property 
OK.l in Section 3.2 of 0). Hence, there exists a unique 
point Zfe in 7Z* kg nearest to c ko . By moving the codeword 
at c ko along the line joining Cfe and z k() away from the 
latter, the distance from the codeword to every point in 
IZt is increased. We continue thus until the codeword 
is moved to the surface of the power-constraint sphere, 
at say c'fc . Let us call the resulting code c <o\. Note that 
"A {cfeol = *A {c'fcj- Now consider 
M 



w* ^* OA) = E min / e 



(c) 



k=l 
M 

E 

k=l 
M 

fe=l J 



\\y-^k\\ 
e 2^ dy 



l|y— "a- M 
2^ dy 



"dy 



> E 

fe=i 

fc^fco 



_ lly--=fc II 
e 2^ dy 



e dy 



Ef lly-f II 2 , 

mm / e 2^2 dy 
k=,^U 

k^k 



l|y-c|r 

mm / e 2^ dy 

ce% Jtz* 

= mx a [? u sr («■)) 

ivi* (ID 

Here, (c) follows from (b) in the proof of Lemma [3] (d) 
follows from the construction of c' ka so that for every 

y g ftfco'lly ~ Cfc oll - II y _ c ' fe oll' and w foll ° ws 

from Lemma [3] Note also that we have only treated the 
case where TZ k is non-empty. If on the other hand, that 
decision region was empty, we can move the codeword 
at Cfc along any arbitrary direction. For such a case 
inequality (b) becomes an equality since the integrals 
over lZ* k would be zero. From a given code c &, we can 
thus obtain a code ^1 having one more codeword on 
the surface of the power-constraint sphere, also satisfying 
\L a (<«f)) > fi a (^1)). We can repeat this 

process several times to eventually obtain a code c €' with 
all codewords on the power constraint sphere. ■ 

Lemma 5: For any rate R length N code & satisfying 
the power constraint Vc G c <f, j|cj| 2 = NPq, 

m) > Q (^—±n Ql N, ^ 

Proof: For any code ^ that satisfies the requirements 
of the lemma, the decision regions in £%* (^) consists 
of pyramids with their apex at the origin and extending 
out to infinity (see Appendix [A] for a proof). Assume 
that each of these regions {TZ%} k=1 cut out a surface 
of area H k on the unit iV-sphere centered at the origin. 
Note that for each codeword Cfe G c i corresponding to 
message k, the decision region 1Z* k contains the point — Cfe. 
Consider any term in the summation of the expression for 
fi a (^)): 



mm 



2 /2o 



(27TCT 



2\T 



-dy 



3 -||y-c fc || 2 /2<r 2 



re;: 



(27T(J 2 ) 



2\~ 



-dy. 



The right hand side of the above equation is equal to the 
probability of the event E\ that the transmitted codeword in 
R N located at c k on the sphere ||x|| 2 = NPq is displaced 
by the noise vector into a specific region lZ* k that contains 
the point — c k . Now consider the probability of the event 
E2 that the same transmitted codeword is displaced into 
the iV-dimensional circular cone C k that has its apex at the 
origin, axis running through —c k , and cutting out a surface 
of area fl k on the unit sphere centered at the origin (i.e., the 
solid angle of the iV-dimensional circular cone is fl k ). We 
claim that the probability of E\ cannot be smaller than 
the probability of E^. A proof of this claim is provided 
in Appendix [B] The probability of the event E2 is equal 
to Q (fi — &k,N, Pq/ct 2 ), from the definition of the Q- 
function in Definition [5] Hence, 



k=l 



mm 

3 Jnt 



2 /2<r 2 



(2na 2 ) 



N/2 



-dy 



M 



fe=l 



(12) 



Noting that Q is a convex function of flo — ilk (See Section 
III in |3|), we apply Jensen's inequality to (fT2l >: 

i M / p 



k=l 



since £)k=i ^fc = ^o- ■ 

We are now ready to prove Theorem Q] 

Proof of Theorem [7} Consider any link i. For the 
code c tc?i-i satisfying |jc|| 2 < NP ,Vc G ^-i, we can 
apply Lemma [4] to construct another code c ^l_ 1 such that 

Vc € ^iJIcll 2 = JVP and (X-i,^* (^-i)) > 
/V 4 We then have: 



Ma, 



?<-i,#i) > M CTi (<^_i,^*(^_i)) 
(&) 

> /V (^-1,^(^-1)) 

( c ) [M-l Pn 



A/ 



( d ) / M - 1 1 



In the above chain of equations (a), (b), and (c) follow from 
Lemma [3] Lemma [4] (as discussed above), and Lemma [5] 
respectively. Inequality (d) follows from Remark Q] since 
of > o-o- Recalling that ft = 1 - fj, ai (X-i,^) and 
applying ([Pil l to (0, we have the desired result: 



(w; Wn) 



< NR 



V. Discussion 

We had mentioned in Section Q] that the bound presented 
in the current paper improves and complements the bound 
provided by J2). In this section, we demonstrate this fact 
with a comparison plot. The bound given by Q is: 

l(W\W n ) < 2 NR (l-e~ NE ( p °/^y {1 ~ e) (15) 
where for any S > 0, 

(5 + 2) + ^(5 + 2)2 -4 



25 (S) 



|log{(5 + 2) + v /(5 + 2)2-4}. 



for asymptotically large n. The above bound decays 
with n as ^1 — e~ NE ( p °/ a °'\ for any code rate R, 
while the bound given by Theorem Q] decays as 
'l-MQ(^n ,^|)). Now, let 

log (MQ^fio.JV.S)) 



E a8 {R,S)^- lim 

JV->oo 



We now investigate how these two bounds compare when 
N is asymptotically large, by comparing the values of 
E(S) and E as (R, S) where S = P /^o- To do so we 
obtained E as (R, S) as a function of R and 5 using the 
asymptotic analysis in Q. The expression for E as (R, S) 
is given below, with a justification in Appendix ICl 



E as (R, S) = S- ^VSGcos6~\og(Gsm6). 



where 



G=- (V5cos6»+ ^4 + Scos 2 



and 9 = it — sin -1 2 _i? . Shown in Fig. [2] is a plot of 
E as (R, S)/ E(S) as a function of S, repeated for various 
R. As can be seen from the plot E as (R, S) is always 
smaller than E(S) for any R and S, thus showing that 
the bound obtained in Theorem Q] is tighter than the one 
given by dlBV The current bound is also seen to be better 
when the SNR S is not very high. 




Signal-to-Noise Ratio, S in dB 
Fig. 2: Comparison of exponents for large N. 
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Appendix A 

The Farthest-point Voronoi Tessellation for 
Points on a Sphere 

Lemma 6: Given any M, N £ M, the non-empty cells 
in the farthest-point Voronoi tessellation of Mr w.r.t. any 
set of M points on an A^-sphere of radius A > are all 
semi-infinite pyramids. 

Proof: Consider the farthest-point Voronoi tessellation 
of R w w.r.t. a code ^ = (ci,.. . , c M ) s.t.||c;|| 2 = A 2 , 1 < 
i < M. Consider any x 6 R w and assume without loss 
of generality that it is contained in TZ\, the farthest-point 
Voronoi cell for c[. In that case, we have for all i s.t. 
1 < i < M, 

|| Cl -x|| 2 > ||c 4 -x|| 2 
=> ||c 1 || 2 + ||x|| 2 -2(c 1 ,x) > ||c J || 2 + ||x|| 2 -2(c J ,x) 
=*--<ci,x) > -(c i)X ). (16) 

Consider any a > 0. We claim that ax is also contained 
in 1Z\. This can be shown to be true by applying ( TTBT i to 
the expansion of ||ci — ax|| 2 : 

j|ci-ax|| 2 = ||ci|| 2 + a 2 ||x|| 2 - 2a(ci,x) 
> ||c.i|| 2 + a 2 ||x|| 2 - 2a(cj,x) 
= He, -axj| 2 , 

for all i s.t. 1 < i < M. Hence, ax is also contained in 
TZ\ . Generalizing this, we have shown that any non-empty 
Voronoi cell that contains a point x also contains the point 
ax for any a > 0. Such a region is a semi-infinite pyramid 
by definition. ■ 

Appendix B 
Proof of the claim in Lemma[5] 

Consider the N — 1 dimensional cross-section of the 
pyramid lZ* k cut out by a sphere of radius R centered at 
the origin. This will be an arbitrary spherical polygon. The 
cross-section of the cone C k by the same sphere will be 
a spherical cap with its center at — c&. The axis of the 
cone cuts through the spherical cap at its center. The non- 
overlapping regions of such a spherical cap and a polygon 
are illustrated in Fig. [3] Since the both the cross sections 
have the same surface area R N VLk, the surface areas of the 
non-overlapping parts of both the cross-sections (indicated 
as A\ and A^ and by two different shadings in Fig. |3j 
are equal. Now, every point in the shaded region A\ on 
the polygon is nearer to Cfe than any point in A2 is to c^. 
This is because the former lies outside the spherical cap 
centered at — while the latter is inside the same. This 
in turn implies that the angle 62 <E [0, n] between the axis 
of the cone and the line joining any point y 2 on A2 and 
the origin is smaller than the angle 9\ € [0, 7r] between 
the axis and the line joining any point yi on A\ and the 
origin, as shown in the right hand side of Fig. [3] This in 
turn implies that yi is closer to than y2 is to Cfc, as 
shown below: 

||c fc - yi || 2 = NP + R 2 + 2Ryftm cos 6 1 
< NP Q + R 2 + 2R^NP COS t>2 

= Hc fc -y 2 || 2 . 



This in turn means that the integral of the density function 
of the Gaussian noise vector with center at over the 
volume 7Z* k is greater than the integral over the volume 
Cj|;. The former is the probability of the event E\ and the 
latter is the probability of the event E 2 - 

Appendix C 
Asymptotic exponential decay of the Q 
function with N 

Though it is hard to express Q in terms of elementary 
functions, it is easy to obtain asymptotic approximations 
when the block length N is very large. The idea is to use 
Shannon's computation of the sphere-packing exponent. 
Shannon derives a bound on <2(.) as a function of the 
cone angle 9 E [0,tt] instead of the solid angle £1 since this 
makes asymptotic analysis easier. This results in a bound 
for Q (.) that decays exponentially in N, with the exponent 
being 

E l ( e ) = p2-l\ ^GcosO- log (G sin0) , 
zap 1 y ct 

where 

The bound on Q(.) for a given can then be evaluated 
numerically or by any other means, since there is a one- 
to-one correspondence between the cone angle 8q and the 
solid angle £1 (see Fig. @): 

n(6 )= WA r +n — / (sin0o) d0 o . 
1 {-2-) J o 

The particular case of interest in J3] is Q (^jjQq, N, 5m, 

which corresponds to the cone angle 8 = sin -1 2~ R 
and the sphere-packing lower bound is obtained thus 
(see pages 620 and 625 in O). Our bound involves 
Q (^ M A J 1 flQ^ N, instead, and hence we will have to 

evaluate the exponent El (9) with 9 = tt — sin -1 2~ R 
instead, giving us the following result: 




Fig. 4: Relation between solid angle and cone angle. 

1 (W; W n ) < (l - e-^M^/^))" , (17) 
where E as (R, S) is as shown in ( f]~8b . 



